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ABSTRACT 

The GALAH survey is a large high-resolution spectroscopic survey using the newly 
commissioned HERMES spectrograph on the Anglo-Australian Telescope. The HER¬ 
MES spectrograph provides high-resolution (R ~28,000) spectra in four passbands 
for 392 stars simultaneously over a 2 degree field of view. The goal of the survey 
is to unravel the formation and evolutionary history of the Milky Way, using fossil 
remnants of ancient star formation events which have been disrupted and are now 
dispersed throughout the Galaxy. Ghemical tagging seeks to identify such dispersed 
remnants solely from their common and unique chemical signatures; these groups are 
unidentifiable from their spatial, photometric or kinematic properties. To carry out 
chemical tagging, the GALAH survey will acquire spectra for a million stars down 
to V ^14. The HERMES spectra of EGK stars contain absorption lines from 29 ele¬ 
ments including light proton-capture elements, a-elements, odd-Z elements, iron-peak 
elements and n-capture elements from the light and heavy s-process and the r-process. 
This paper describes the motivation and planned execution of the GALAH survey, 
and presents some results on the first-light performance of HERMES. 
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1 INTRODUCTION 

About half of the stars in the Milky Way are thought to 
have formed befor e an age corresponding to redshift z 
jLineweave il 1 199911 . We see ancient stars around us in the 
old thin disc, the thick disc, the stellar halo, the inner 
bulge, in globular clusters and in satellite dwarf galaxies. 
We are coming into a new era of Galactic investigation, in 
which one can study the fossil remnants from the earliest 
phases of the Galaxy’s formation throughout its major lu- 
mi nous components. The earliest galaxy formation scenario 

bv lEggen. Lvnden-Bell &: Sandara lll962l ~) suggested the dis- 
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sipative collapse of a large isolated protogalactic gas cloud, 
which eventually settled into a disc. The idea of galaxy for¬ 
mation through hierarchical aggregation of smaller elements 
from the e arly Universe later g ained support from observa¬ 
tiona l (e.g. Searle__fc__Zinn 197^ and theoretical le.g. lPeeblesI 
Il97ll : IPress fc Schechteif 1974) perspectives. Today we ac¬ 
knowledge that both processes of dissipative collapse and 
satellite accretion play a key role in the formation and evo¬ 
lution of galaxies. The influence of infall of small evolved 
satellite systems is not confined to the stellar halo: it may 
also affect the build-up of the disc and bulge. Such infall 
is prominent in Gold Dark Matte r (CDM) simulatio ns of 
galaxy formation (see for example I Abadi et al.l[200^ . and 
we need to consider the detection of the debris of such sys¬ 
tems in the context of the archaeology of the Galaxy’s major 
components. 
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Identifying the debris of smaller building blocks of the 
Galaxy offers the possibility to reconstruct at least some 
properties of the proto-galaxy and so to improve our basic 
understanding of the galaxy formation process. Th is idea 
was p resent in early studies of moving stellar groups (lEggenl 
I1974 1. and is now becoming a major field of research in theo¬ 
retical and observational astronomy. The goal of these stud¬ 
ies in Galactic archaeology is to unravel the early Galac¬ 
tic history by using the stellar relics of ancient in-situ star 
formation and accretion events, primarily focused on the 
building-up of the disc and bulge, which contain nearly all 
of the stellar mass of the Milky Way. 

That said, unraveling the history of disc formation is 
likely to be challenging as much of the dynamical informa¬ 
tion such as the integrals of motion is lost due to heating. We 
need to examine the detailed chemical abundance patterns 
in the disc components to reconstruct substructure of the 
protogalactic disc. Pioneering stud ies on the chemodynami - 
cal evolution of the Galactic disc bv lEdvanfgsoi 2 _et_ah (1993 ) 
followed by many other such works (e.g. Reddv et al.l 2003 ; 
iBensbv. Feltzing fc O(^l20l4l . show how trends in various 
chemical elements can be used to resolve disc structure and 
obtain information on the formation and evolution of the 
Galactic disc, e.g. the abundances of thick disc stars rela¬ 
tive to the thin disc. The effort to detect relics of ancient 
star formation and the progenitors of accretion events will 
require gathering kinematic and chemical composition infor- 
mat ion for vast numbers of Galac tic field stars. 

[Freeman fc Bland-HawthornI (l2002li proposed the 
method of chemical tagging in order to reconstruct some of 
the ancient star-forming aggregates in the disk. The goal 
of the Galactic Archaeology with HERMES (GALAHj^ 
survey is to acquire high-resolution spectra of about a 
million stars for chemical tagging, in order to investigate 
the assembly history of the Galaxy. This paper describes 
the scientific motivation for the GALAH survey, starting 
with some general points about the main components of the 
Galaxy in Section [2] In Section [3l we discuss some specifics 
of chemical tagging. Section [4] presents a summary of the 
HERMES spectrograph, which is the workhorse instrument 
of the GALAH survey. Section [5] provides an overview of 
the survey operations and in Section [6] we discuss survey 
scope and scientific opportunities. 


2 GALACTIC COMPONENTS 

In this section we present a brief overview of the major 
Galactic components in order to provide a context for the 
later discussion of the GALAH survey scope. 


2.1 The Halo and its Satellites 

The metal-poor stellar halo is a minor component in terms 
of the relative stellar mass content of the Galaxy. It hosts 
some of the oldest Galactic stellar populations and is dynam¬ 
ically and ch emically very different from the rest of the (stel¬ 
lar) Galaxy (lHelmill2008l L It is spatially very extended and 


^ http://www.mso.anu.edu.au/galah/home.html 


slowly rotating. The dynamical timescales are long and dy¬ 
namical interactions less severe than in the disc. Halo stars 
can therefore preserve their integrals of motion more readily 
and their dynamics are more likely to contain useful informa¬ 
tion about the assembly of the halo. For discussions on the 
state of the Galacti c halo based on Sloan Dig i tal Sky Sur¬ 
vey (S P SS) see e.g. Carollo et al.l 


201oli: Beers et aP 


ll201^: Fermani fc Schdnrichl (|2 013ll: iKafle et al.l ~i 2013ll : 
ISchonrich. Asplund fc Casagrandd I2014l'l. 

We suspect that a large fraction of the halo stars may 
be remnants of early satellite galaxies which experienced 
independent ch emical evolution before being accreted on 
to the Galaxy ilKirbv et al.ll201ll L The discovery of many 
dwarf satellites and s tellar streams including the S agittar- 
ius dwarf spheroidal (llbata. Gilmore fc Irwin [l994 ), is evi¬ 
dence that such processes are still underway I Zucker et al.l 
l2006lf . The stars in dwarf spheroidal galaxies generally have 
lower a-element abundances compared to the halo stars, 
except at very low metallicities ([Fe/H] < —1.5), where 
the a-element abundances overlap with those of halo stars 
llTolstov. Hill fc Tosil l2009l l . Identifying any disrupted pri¬ 
mordial counter-parts of the dwarf satellites currently orbit¬ 
ing the Galaxy, will serve as probes into the conditions of 
halo assembly as well as star formation in the early Universe. 

Globular clusters represent some of the oldest stellar 
populations in the Galaxy with characteristic photometric 
and light element ab undance peculiarities including the O - 
Na anti-correlation dGratton. Carretta fc Bragaglial I 2 OI 2 I '). 
Stars with such particular light-element abundance patterns 
are observed mostly wit hin globular clusters, but not gen¬ 
erally in the field (e.g. |GraTtqn__et_alJ 200(]|) or in Local 
Group dwarf galaxies fe.g. iTolstov. Hill fc Tosil 200^)Trhe 
small fraction of such stars in the field fe.g. ICarretta et al.l 
I 2 OI 0 I : iMartell et al.llioill : IWvlie-de Boer et all 2012h are in¬ 
terpreted as evidence of cluster mass loss and dissolution. 
Globular cluster formation and origin is a subject of active 
research and the detection of extra-tidal tails and field stars 
that can be chemically tagged back to globular clusters will 
provide significant new insights. 


2.2 Thin and Thick Discs 

The Galactic thin disc shows a mean r adial metallicity gra - 
dient of about —0.07 dex kpc“^ (e.g. ICheng et al.l [200 1. 
Near the Sun, the thin disc stars cover a wide range in age, 
with the oldest stars having ages up to about 10 Gyr. Their 
age-metallicity relation shows that stars of almost all ages 
up to about 10 Gyr cover a broad metallicity range (from 
about —0.5 to -1-0.5 dex), and only a weak trend of increas¬ 
ing metallicity with decreasing age is discernible. The cur¬ 
rent belief is that the most metal-rich thin disc stars near 
the Sun did not form near the Sun but rather formed in the 
more metal-rich inner Galax y and migrated radially out t o 
the solar neighbourhood (e.g. iGreno nlll999l: lHavwoodll2008l'l . 
Radial migration is a theoretical prediction that has gen¬ 
erated a great deal of interest recently. It is believed to be 
driven by the torques of the Galactic bar and transient spiral 
arms, which can move stars radially in wards and outwards 
from one near-circular orbit to another ilSellwood fc BinnevI 
l2002ll . without significantly heating the disc. We do not 
know how important radial mixing has been in determining 
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the current state of the Galactic disc. The GALAH survey 
data will be able to answer this question (see Section [6.411 . 

In addition to their defining thin discs, almost 
all spirals appear to hav e a thick disc component 
dYoachim fc DalcantorJl2006[ ). The Galactic thick di sc was 
originally identified throug h vertical star counts (lYoshiil 
1 19821 : iGilmore fc ReidIIl983l ) , and has since been shown to 
be generally older, more metal-poor and more a-enhanced 
than the thin disc, though there are also old metal-rich stars 
with thick-disc-like orbits (i.e. large scale heights and slower 
rotational velocity). Thus, it is an open question whether 
the stellar disc can form a thick component with time be¬ 
cause of internal (e.g., scattering, radial mixing) or external 
(accretion e vents, mergers) mechanis ms or is a continuous 
distribution (IBovv. Rix fc Hogsll2012l 'l. 

For large spirals like the Milky Way, the thick disc mass 
is typically about 10% of the thin disc in the solar neigh¬ 
bourhood, and its vertical scale height is about 1000 pc; 
the scale height of the thin disc is typically about 300 pc 
dCasagrande et aDl201lll . Stars of the thin and thick discs 
have different motions and different density distributions. 
For a recent assessm ent of disc kinematic parameters see 
ISharma et al 1 (l20li) . 

Near the Sun, although the thick disc is on average 
more metal poor, the metallicity ranges of the thick and 
thin disc stars overlap. The thin disc stars have [Fe/H] in 
the range —0.7 to -1-0.5, while most of the thick disc stars 
have [Fe/H] between about —1.0 and —0.3, with tails ex¬ 
tending to —2.0 and —0.1 dSchlesinger et all 120121 '). In the 
[Fe/H] interval over which the two disc components over¬ 
lap, the_tluck_disc_staxs_axedisfectly more a-enhanced (see 
e.g. lBensbv. Feltzing fc Oevl20lJl . Many fundamental ques¬ 
tions remain unanswered, such as the metal-poor limit of the 
thin disk, and the metal-rich limit of the thick disk. What 
is the radial and vertical extent of the thick disc and do 
they display any metallicity gradients? With a simple se¬ 
lection function and large sample size, the GALAH dataset 
will play a crucial role in answering these basic questions 
(see Section ESI). 


2.3 Bulge 

The bulge of our Galaxy is archaeologically interesting. 
Small boxy bulges like the bulge of the Milky Way are 
now not regarded as predominantly being the products of 
mergers. The Milky Way bulge is believed to have formed 
several Gyr ago via bar-for ming and bar-buckli ng instabil¬ 
ities of the early disc (e.g. I Athanassoulal 120051 ) . The disc 
forms an elongated bar structure at its center, which then 
buckles vertically and settles into the long-lived boxy shape. 
These instabilities of the disc redistribute the disc stars into 
the bulge. This picture is supported by comparisons of ra- 
dial velocity obs ervations to N-body model predictions (e.g. 
IShen et al.ll20ld L The different components of the early in¬ 
ner disc (thick disc, old thin disc, younger thin d isc) end up 
trapp ed dynamically within the bulge structure jNess et al.l 
I 2 OI 3 I L Their distribution within the bulge depends on their 
initial phase space distribution before the instability. We 
can see the relics of these Galactic components now in the 
m etallicity distribu tion function of the bulge stars (see Fig 1 
in iNess et al.llioi^ L These metallicity components are seen 
throughout the bulge, but their relative weights change with 


position in the bulge. In this way, we see a fossil image of 
the early inner Galaxy, mapped into the bulge. 

Although the bulge giants are mostly too faint (V > 14) 
for the GALAH survey, dedicated bulge programs using the 
HERMES instrument have the potential to contribute very 
signihcantly to understanding the chemical evolution of the 
bulge and inner disk. 


3 CHEMICAL TAGGING 

The idea of chemical tagging is to use the detailed chemi¬ 
cal abundances of individual stars to tag or associate them 
to common ancient star-forming aggregates (including dis¬ 
rupted satelhtes}_whose_stars_J]nwshnhar abundance pat¬ 
terns dFreeman fc Bland-Hawthorn|[200^ . After their orig¬ 
inating clusters or host galaxies disperse, these stars may 
lose much of their dynamical information, such as the in¬ 
tegrals of motion, because of heating and radial migration. 
The stellar abundance pattern over many elements likely re¬ 
flects the unique chemical state of the gas from which the 
aggregate formed, analogous to a star’s DNA prohle. 


3.1 Open clusters 

A vital condition for the chemical tagging process 
is that star clusters be chemically homogeneous, 
and their abundance distributions be sufficient l y dif¬ 
ferent from cluster to cluster Silva et ni l2007l : 
IPe Silva. Freeman fc Bland-Hawthorn 2009l L Almost all 


surviving open star clusters are chemically homogeneous at 
the level of the measurement uncertainties (~0.05 dex) for 
heavier elements and often also for the lighter elements in 
stars whose surface abundances are not affected by stellar 
evolutionary effe cts such as dredge-up or deep mixing (see 
iFriel et ^ (|2014| ~) for a recent example). Careful differential 
analyses of high-resolution data with high signal-to-noise, 
which removes much of the line-to-line systematics, show 
abundance scatter at the level of ~0.02 dex, where the source 
of chemical inhomogeneity in open clusters could include 
effect s of planet formation (lRanmeZj_^elendez_&_^s£huM 


200£) and/or atomic diffusion llOnehag. Gustafsson fc Korn 


201411 . Such levels of scatter may be the limit at which we 


can chemically resolve individual star formation events. 

Theoretically, a high level of chemical homogene¬ 
ity is expected even within regions of low star forma¬ 
tion efficiency that do not produce bound open clus¬ 
ters dFeng fc Krumhoi3 l2014| f . Dynamically, open clusters 
that remain bound over a Gyr may be anomalous disc 
objects. Star clusters that disperse within the molecu¬ 
lar clouds may have been the major contributors to the 
building of the Galactic disc (iLada fc Ladal l2003lf . En- 


hanced [Na/Fe] in old 
in the literature (e.g 

open cluster stars have been rei 

sorted 

Jacobson. Friel & Pilachowskil 

I2OO7I; 

ISchuler. King & Thel 

2009|f whereas solar level Na 

abun- 


dances are reported in held star studies. This result ques¬ 
tioned whether old open clusters are chemically different to 
the star formation events that contributed to the Galactic 
disc. However, this discrepancy in Na abundances is likely to 
be a result of analysis method, where departures from local 
thermodynamic equilibrium were not taken into account in 
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earlie r open cluster studies jMacLean, De Silva fc Lattanzid 
l2015h . 


3.2 Moving groups 

The Galactic disc near the Sun shows some kinematic over¬ 
densities or substructure. These are sets of stars that have 
some degree of common motion, and they are broadly re¬ 
ferred to as stellar moving groups. The stars of the moving 
groups are all around us: they are seen as concentrations in 
velocity but not in position. Some of these groups are the 
debris of disrupted old star clusters in the disc. Examples 
include the HR 1614 , Wolf 630 and Argus moving groups 
jPeSilva et al.l l2007l : iBubar fc Kind 120101 : IPe Silva et al.l 
I 2 OI 3 II . These groups are now dispersed into extended regions 
of the Galaxy, and their stars are chemically homogeneous 
to within 0.05 dex and have common ages. The HR 1614 and 
Wolf 630 groups are about 2 Gyr old but still retain some 
kinematic identity. We can expect to find many older dis¬ 
rupted groups which have lost all their kinematic identity, 
distinguished by the fact that they preserve their chemical 
identity. These are the relics that trace the star formation 
history of the Galaxy, and are of great interest for Galactic 
archaeology. 

Other moving groups, such as the Hercules group 
llBensbv et al.ll2007l 'l. represent dynamical resonances asso¬ 
ciated with the pattern speed of the Galactic bar or spi¬ 
ral structure. The Hercules group members are a chem¬ 
ically typical sample of the nearby disc, with a wide 
range of chemical abundances. These resonance groups 
are dynamically interesting but are of limited interest 
for Galactic archaeology. Further chemical information 
shows that a co-moving group of stars can have multi¬ 
ple origins, such as the Hyades Supercluster. In this case 
some of the dispersed stars share identical abundances 
to that of the Hyades open cluster, while the majority 
of the co-mov ing stars pres e nt th e field star abundance 
distributions _(lde_SiIv^_et_alJ_l201]J;_lPonir )eia^_etal. I I 2 OIII: 


iTabernero. Montes fc Gonzalez Hernandea I 2 OI 2 I '). showing 
that the Hyades Supercluster has a combined dissipative 
and dynamical origin. 


clouds. Groups like the HR 1614 group are rare examples of 
dispersed clusters which cannot be identified spatially but 
are still identifiable both chemically and kinematically. Most 
older dispersed aggregates would now not be recognisable 
dynamically, and chemical techniques provide the only way 
to identify their debris. 


3.4 C-space 

The process of chemical tagging operates within a multi¬ 
dimensional space (G-space) whose axes are the measured 
relative abundances of the stars being studied. The C-space 
co-ordinates are [Xi/H], [X 2 /H], ... [X„/H]. From HERMES 
spectra of GALAH targets, abundances for up to 29 ele¬ 
ments per star will be measurable. Not all of these elements 
vary independently from star to star; many vary together 
in near-lockstep. The number of independent dimensions of 
this chemical space appears to be between 7 to 9, possibly 
depending on the domina nt nucleosyntheti c processes in a 
given Galactic population dTing et al.ll201 j l. 

Stars that originate from chemically homogeneous ag¬ 
gregates like dispersed clusters will lie in tight groups in 
C-space. Stars which were accreted from dwarf galaxies will 
lie along chemical sequences that reflect the unique history 
of star formation and chemical enrichment in those galax¬ 
ies. We expect these sequences will be distinct in C-space 
from the stars of the Galactic thin and thick discs. We ex¬ 
pand the discussion on the dimensionality of C-space and 
required sample size for chemical tagging in Section |S] in 
relation to the scope of the GALAH survey. 

With this chemical tagging approach, we expect that 
it will be possible to reconstruct old dispersed stellar ag¬ 
gregates in the Galactic disc, derive their ages and their 
kinematic and metallicity distributions, and so build up a 
picture of the star formation history and the dynamical and 
chemical evolution of the Galactic disc since the disc be¬ 
gan to assemble. This kind of chemical tagging experiment 
needs a high-resolution spectroscopic survey of about a mil¬ 
lion stars, homogeneously observed and analysed - this is 
the GALAH survey. It is the prime science driver for the 
HERMES instrument. 


3.3 In-falling satellites 

Some moving groups may be the debris of in-falling 
dwarf galaxies that were tidally disrupted in the pro¬ 
cess of being accreted by the Milky Way. The dis- 
rupting Sagittarius dwarf ga laxy is a familiar example 
(llbata. Gilmore fc IrwinI Il994ll . The Kapteyn group is an¬ 
other example, as a possible remnant of the tidal debris 
of the unusual globular cluster Omega Gentauri during a 
merger event that deposited Omega Gen into the Milky Way 
dWvlie-de Boer. Freeman fc Williamsll201(il : lMaiewski et al.l 
l2012h . One of the GALAH survey goals is to identify such 
debris using chemical techniques, even if the debris has lost 
its kinematic identity, because this provides a way to make 
a direct estimate of the accretion history of the Milky Way. 

In summary, although the disc does show some surviv¬ 
ing kinematic substructure in the form of moving stellar 
groups, we can expect that much of the dynamical infor¬ 
mation was lost in the subsequent heating and radial mix¬ 
ing by spiral arms and interactions with giant molecular 


4 THE HERMES SPECTROGRAPH 

A million star survey as discussed in this paper is only pos¬ 
sible with a spectroscopic facility combining high resolution 
and high multiplex over a large field of view and sufficient 
spectral coverage in a single exposure. This facility is the 
High Efficiency and Resolution Multi-Element Spectrograph 
(HERMES) recently commissioned on the AAT. It is cur¬ 
rently unrivalled as the only optical spectrograph with this 
combination of capabilities. In this section we briefly de¬ 
scribe the 2dF-|-HERMES design and performance. For full 
det ails of HERMES fun ctionality and commissioning results 
see ISheinis et al.l (l2015lj . 

HERMES receives light from the existing 2-degree Field 
(2dF) fibre positioning system, mounted at the prime focus 
of the AAT, which collects light into 400 fibre s over a circular 
field 2 degrees in diameter (iLewis et al.ll2003l . Eight of these 
fibres are fiducial fibres used to align and guide the observing 
field on user provided catalogue stars. The remaining 392 
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fibres are available for science use, of which at least 25 are 
recommended for sky background observations. The fibre 
diameter is 2.1 arcsec and the robotic positioner has a fibre 
positioning accuracy of 0.3 arcsec. 

The two-prism atmospheric dispersion corrector (ADC) 
at the prime focus of the AAT automatically counteracts 
the colour spread in each star image due to atmospheric 
dispersion. There are also positional errors due to differen¬ 
tial atmospheric refraction across the 2dF field during long 
exposures: these are minimised by the customised 2dF fibre 
configuration software and by observing close to the merid¬ 
ian. Third, due to its original design requiring a flat field 
and telecentric input beams, there is systematic chromatic 
variation in distortion (CVD) across the 2dF field. The ef¬ 
fect is negligible at the center of the field, where the light 
from different wavelengths coincides. At larger radii on the 
2dF plate, the light is spread radially up to a maximum of 
about 0.6 arcsec bet ween the centroids at 5250A and 8500A 
dCannon et al.|[2008l i. Towards the edges of the field the ra¬ 
dial shifts decline but the images become more diffuse. Thus 
there are three colour-dependent effects that all contribute 
to the fibre positioning accuracy and hence to the effective 
throughput of 2dF plus HERMES. The first is largely cor¬ 
rected by the ADC, the second is minimised by the config¬ 
uration software and careful planning of the observations, 
but the third is a feature of the system which constrains the 
performance across wide wavelength ranges. 

The fibre cable of almost 50 meters in length runs 
from the telescope prime focus into a stable, temperature- 
controlled room inside the AAT West Coude laboratory 
where the HERMES spectrograph is located. The 2dF posi¬ 
tioning system contains two observation plates, which al¬ 
lows the fibre configuration to be updated on one plate 
while the other plate is collecting data. Correspondingly, 
the HERMES slit assembly holds two interchangeable slit 
units, which provide an accurate and stable interface for the 
two fibre feeds coming from 2dF. 

Post slit, HERMES has a single off-axis collimator and 
two off-axis corrector lenses, four VPH (Volume Phase Holo¬ 
graphic) gratings and cameras and three dichroic beam split¬ 
ters. Each camera feeds one 4096x4112 15/rm pixel CCD 
from the E2V CCD231-84 family. The Blue and Green chan¬ 
nel detectors are both 16 micron standard silicon devices. 
The Red channel detector is a 40 micron, deep depletion 
device and the ”IR” channel detector is a 100 micron bulk 
silicon device. Both Red and IR CCDs have fringe suppres¬ 
sion coatings. Each detector is independently controlled and 
permits readout from one, two or four readout amplifiers, at 
various clock rates with windowing and binning options. 

HERMES provides fixed format spectra in four non¬ 
contiguous wavelength bands covering a total of about 
lOOOA between 4713A and 7887A. HERMES is designed to 
operate at two resolution modes, with a resolving power of 
R ~ 28,000 being the nominal resolution mode and a higher 
resolving power of R ~ 42,000 achieved by manually insert¬ 
ing a mask on kinematic mounts on the slit assembly, at 
the cost of 50% light loss. The standard HERMES setup 
for GALAH observations uses the nominal resolution mode 
with two amplifiers in fast readout (4e readout noise) for 
all CGDs. Fu rther details o n the HERMES design and per - 
formance see iBarden et al.l ll20ldi: iHeiimans et al.l (120121): 

I Shemisetal. imr ^ -^ 


5 SURVEY OBSERVATIONS 
5.1 Target selection 

The GALAH survey plans to observe ~1,000,000 stars, using 
AAT-I-HERMES in the nominal resolution mode. Our pri¬ 
ority is to keep the GALAH selection criteria as simple as 
possible, with a baseline selection being a magnitude-limited 
sample corresponding to 12<V<14 (with some excursions 
brighter and fainter for special targets) with |6| > 10 de¬ 
grees and declination between —80° < <5 < -|-10°. We use 
the Two Micron All Sky Survey (2MASSj3 JHK photome¬ 
try dSkrutskie et al.ll2006l 'l to estimate V(J,K) to define our 
target selection criteria, where 

V{J, K) = K + 2{J-K + 0.14) -b o.382e“'^-^"°-^^''“®°^. 

The AAVSO Photometric All-Sky Survey (A PAsfl 
Henden_et_ajJ (120121 ): iHenden fc Munao (l20lJ) : 

Munari et al.l (l2014l) ) optical photometry as available, 
is used to estimate the stellar magnitudes in the HERMES 
bands to check throughput estimates. These, together 
with 2MASS photometry act as photometric priors for the 
subsequent sp ectroscopic analysis, e. g. using the Infrared 
Flux Method dCasagrande et ^120101 ). 

The above mentioned selection criteria is used to choose 
fields with target densities > 400 stars per 2dF field to allow 
optimal allocation of fibres. Using an adaptive tiling strat- 
egj0 , the input catalogue is divided into 4303 field centers 
which in turn yield 6546 independent ’configurations’ con¬ 
taining 400 unique stars. Of these, 3300 configurations need 
to be observed in order to reach the target number of stars 
(see Figure [1]). 

The stars in such a magnitude-limited survey will have 
a double-peaked temperature distribution, with one peak 
dominated by stars near the main sequence turnoff and the 
other by clump giants (see Figure [2J. Tabled provides an 
estimate of the expected contribution from the giants and 
dwarfs from each of the main Galacti c components, based 
on Galaxia models (ISharma et al ■|2011|) . The old disc dwarfs 
are seen out to distances of about 1 kpc, the clump giants 
to about 5 kpc, and the brightest halo giants to about 15 
kpc. 

The ObsManager tool developed within the GALAH 
collaboration interfaces with the GALAH input catalogue 
to provide observers with 2dF configuration files appropri¬ 
ate to a particular date and time. For each field center in our 
observing plan, ObsManager will select as potential target 
stars only those with no close neighbours. For this selection 
we used the 2MASS prox parameter, which is the distance 
between source and nearest neighbour up to the catalogue 
magnitude limit, where we set a distance limit of prox > 6 
arcsec for all stars and distance greater than 130 — (10 x V) 
arcsec for brighter stars. We use the Tycho and Bright Star 
Catalogs, with larger exclusion zones for brighter stars. Ob¬ 
sManager also ensures a distance of at least 30 degrees 
from the Moon, and avoids fields containing Solar System 
planets to prevent the contamination across large areas of 


^ http://www.ipac.caltech.edu/2mass/ 

® http://www.aavso.org/apass 

Tiling refers to the layout of 2-degree fields on sky. Details of 
the tiling strategy are given in Sharma et al. (in prep). 
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Figure 1. The distribution of the GALAH observing fields. There are 4303 field centers shown as grey open circles, of which 3300 need 
to be observed to complete the survey. The magenta circles show the observations completed as at November 2014. 


Table 1. Percentage of contribution to the GALAH sample from 
Galactic components. 


Component 

Dwarf 

Giant 

Thin disc 

57.87 

18.56 

Thick disc 

4.63 

17.82 

Halo 

0.05 

1.02 

Bulge 

0.00 

0.05 


the observing field. Since the astrometry in our catalog is 
taken from 2MASS, ObsManager includes the positions of 
targets along with th eir proper motions from the UCAC4 
llZacharias et al.ll2013h catalogue when writing out 2dF con¬ 
figuration files. 

Upon completion of observations (see section [5^ . Ob- 
sManager updates the input catalogue to prevent realloca¬ 
tion of observed targets in future field configurations. Tar¬ 
gets marked for repeat observations, individual targets (such 
as calibration standard stars) and high priority targets (such 
as synergistic fields, see section lOl) are also handled within 
ObsManager, to optimally integrate with regular GALAH 
field configurations. A detailed description of the ObsMan¬ 
ager software is presented in Sharma et al. (in prep). 

5.2 Baseline data flow 

The baseline GALAH observing strategy is to observe each 
configuration for three consecutive twenty-minute exposures 
within an hour of transit. In practice, the total exposure time 
is dependent on the seeing, with an additional 20 minutes 
for seeing larger than 2 arcsec and an additional 60 min¬ 
utes for seeing larger than 2.5 arcsec necessary to meet our 
S/N requirement of 100 per resolution element. When seeing 
conditions are worse than 3.0 arcsec, observation of regular 
GALAH fields are halted, although bright calibration obser - 
vations, such as Gaia benchmarks stars (Ijofre et al.l 1201^ 1 
can be carried out if clear skies prevail. 

Our observing strategy also includes repeat observa¬ 
tions of 5% of GALAH targets. Repeat observations allow us 
to monitor the quality of the collected data and the reliabil¬ 
ity of the data reduction, to determine the internal errors in 
our data-analysis pipeline, and to identify variable sources 
in our sample. 

GALAH standard calibration observations consist of 


one UV-Quartz fiat-field lamp exposure and one ThXe arc 
lamp exposure per 2dF configuration, taken immediately be¬ 
fore or after the science observations to ensure a similar tele¬ 
scope position as the target exposure. A set of 15 bias frames 
are taken every afternoon and dark frames are taken inter¬ 
mittently, typically during bad weather. Twilight sky fiats 
and dome fiats improve the fiat-fielding accuracy and are 
taken at all available opportunities. 

The data reduction and analysis is managed by the 
GALAH Analysis Pipeline (GAP) developed by the GALAH 
collaboration. The main stages of GAP consists of a series of 
modules that perform the required data reduction, quality 
control, chemical abundance determination and data prod¬ 
uct delivery. 

As new data becomes available, GAP makes use of the 
2 dfdiQ automatic data reduction pipeline dedicated to re¬ 
ducing multi-fibre spectroscopy data, as well as an IRAF0- 
based reduction tool developed within the GALAH collab¬ 
oration with pre-existing IRAF routines, adapted to reduce 
HERMES multi-fibre spectra. The dual reduction outputs 
are maintained from this point onwards to check the reduc¬ 
tion quality and to aid in the on-going improvements to the 
reduction procedures. 

Following data reduction, quality checks including the 
resolution and signal-to-noise ratio (SNR) is determined for 
each individual spectrum. Rapidly rotating stars and un¬ 
usual stars like spectroscopic binaries and chromospherically 
active stars are identi fied using the local lin ear embedding 
techniques such as bv iMatiievic et ^ (l2012li and set aside. 
Fields with a sufficient number of stars that meet the data 
quality requirements are considered to be ’completed’ and 
removed from the input catalogue by ObsManager. 

For all stars that pass the quality checks, the spectra 
from the four wavelength bands are processed through the 
abundance analysis module. In summary, this module carries 
out the radial velocity and continuum normalisation of the 
spectra, determines the stellar parameters based on spec¬ 
troscopic techniques and available photometric priors, and 
finally calculates individual elemental abundances via spec- 


5 WWW. aao. gov.au/science/software/2dfdr 

® IRAF is distributed by the National Optical Astronomy Obser¬ 
vatories, which are operated by the Association of Universities for 
Research in Astronomy, Inc., under cooperative agreement with 
the National Science Foundation. 


© 2015 RAS, MNRAS OOO.ITHTgI 















The GALAH Survey Motivation 7 



7000 6000 5000 4000 7000 6000 5000 4000 7000 6000 5000 4000 7000 6000 5000 4000 

Te« K K K T,, K 


Figure 2. The effective temp erature vs. magnitude diagram of different galactic populations from a synthetic GALAH survey, based on 
Galaxia llSharma et 


tral synthesis. The choice of atomic line data and model 
atmospheres are user provided inputs. Detailed information 
on the analysis pipeline process is given in Wylie de Boer et 
al. (in prep). 

5.3 GALAH spectra 

The HERMES throughput requirement for GALAH science 
is to obtain a minimum SNR of 100 per resolution element 
(where the resolution element is approximately 4x4 pixels) 
for a 14*^ magnitude star in the corresponding channel cen¬ 
tral wavelength within one hour exposure time. Note the 
nearest Johnson-Cousins filters for the HERMES channels 
are B,V,R and I. Based on GALAH data, Table [5] presents 
the limiting magnitude to achieve the required signal-to- 
noise in one hour exposure under seeing conditions of 1.5 arc- 
sec for each HERMES channel. With the instrument achiev¬ 
ing the required throughput, we expect the bulk of the sur¬ 
vey data (12<V<14) to meet the signal-to-noise require¬ 
ments. Note that for a given star, the signal-to-noise across 
the four channels will depend on the stellar type; a hotter 
star will have more signal in the bluer wavelength channels 
compared to a cooler star of the same V magnitude. 

The achieved spectral resolution in the GALAH data for 
each HERMES channel is R ~ 28,000. The resolution was 
measured by fitting gaussian profiles to ThXe calibration arc 
lamp exposures. The typical values measured from the cen¬ 
tral fibre are presented in Figure [S] However the resolving 
power varies between fibres and between the four channels. 
The combined range of variations in resolving power are rep¬ 
resented by the error bars. 


Table 2. Magnitude limit to obtain signal-to-noise of 100 per 
resolution element in 1 hour exposure 


Channel 

'^min (^) 

-^max ( 4 ^) 

Magnitude 

Blue 

4713 

4903 

B = 14.2 

Green 

5648 

5873 

V = 13.8 

Red 

6478 

6737 

R = 14.0 

IR 

7585 

7887 

I = 13.8 



Pixels 


Figure 3. Typical resolving power measured from the central 
fibre against spectral pixel position measured from a typical 
GALAH observational field. The error bars represent the range 
in resolving power between the tour channels and between fibres. 
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Figure 4. A comparison of HERMES spectra from the Green 
channel in nominal resolution mode for the Hyades dwarf 
HIP13976 against those of the high-resolution spectrographs in 
8-lOm class telescopes. Note the resolution of UVES and HIRES 
spectra are R ~60,000. The SNR is approximately 150, 250 and 80 
per pixel for UVES, HIRES and HERMES spectra respectively. 

HERMES commissioning observations included sev¬ 
eral Hyades open cluster memb ers that also have spec¬ 
tra obtained with VLT-I-UVES dDekker et al.l l200d) and 
Keck-I-HIRES llVoet et al.lll99J) available from those facil¬ 
ities’ archives. Note the resolution and SNR of the HIRES 
and UVES spectra are much higher than that of the HER¬ 
MES spectrum. Both the HIRES and UVES spectra are of 
resolution R ^60,000, compared to R ^28,000 for HERMES. 
The HIRES spectrum has a SNR ~250 per pixel, the UVES 
spectrum has a SNR ~150 per pixel and the HERMES spec¬ 
trum has a SNR ~80 per pixel. The effect of resolution in 
the observations is shown visually in Figure |4] where we plot 
a region of the HERMES Green channel for the Hyades star 
HIP13976. 

The GALAH data sample will provide accurate radial 
velocities as HERMES is housed in a clean, temperature- 
controlled room and mounted on vibration isolators to main¬ 
tain stability. A set of 5 exposures were taken with HERMES 
over 6 nights for the velocity-stable star, HD 1581, which 
has radial velocity variations of 1.26 ms“^ over a time span 
of 25 66 days as confirmed by HARPS spectra (jPepe et al.l 
[2 oT3 ). For each epoch, radial velocities were determined by 
Fourier transform cross correlation against the first epoch of 
observations, making use of the IRAF fxcor packages. 

We find the measured radial velocity varies between 
epochs at the level of about 300 ms“^, which is also the cur¬ 
rently achieved wavelength calibration accuracy. The larger 
uncertainty in wavelength calibration is primarily due to 
lack of accurately known wavelengths for the Xe lines, in 
the ThXe calibration lamp used in HERMES. We expect 
that the wavelength calibration will increase accuracy by 
the use of a ThAr calibration lamp together with the ThXe. 
The expected wavelength accuracy with the ThAr and ThXe 
combination, is of the order 0.002A. With further refined 
wavelength calibration or differential radial velocity deter¬ 
minations, HERMES will be capable of achieving radial ve¬ 
locity stability at the level of 100 ms“^. 

HERMES spectra contain absorption lines from about 
29 elements from the major element groups and nucleosyn- 



RV [km/s] RV [krn/s] RV [km/s] 

Figure 5. Averaged line profiles for several representative ele¬ 
ments In the Gala benchmark star 0 Hyl (Tefj =5873K, log g 
=3.98, [Fe/H] = -0.07, radial velocity = 23.16 kms"^) observed 
with HERMES In nominal resolution mode. 

thetic processes, including light proton-capture elements Li, 
C, O; a-elements Mg, Si, Ca, Ti; odd-Z elements Na, Al, 
K; iron-peak elements Sc, V, Cr, Mn, Fe, Co, Ni, Cu, Zn; 
light and heavy slow neutron capture elements Rb, Sr, Y, 
Zr, Ba, La; and rapid neutron capture elements Ru, Ce, Nd 
and Eu. Figure [S] plots averages of normalized spectral trac¬ 
ings around lines of selected elements in radial velocity space 
(after shifting to zero velocity) in the Gaia benchmark star 
j3 Hyi observed by GALAH. In each panel, the species and 
the number of lines considered are given at the top. 

To make an initial assessment of the abundance accu¬ 
racy achievable from GALAH quality spectra, equivalent 
widths (EWs) were measured from the HERMES solar spec¬ 
tra for 54 Fe I lines and 5 Fe If lines compiled by the GALAH 
abundance analysis group. Note that the simple analysis of 
the solar spectrum presented below aims only to demon¬ 
strate the quality of GALAH data. It is not representative 
of the GALAH pipeline analysis that will carry out a refined 
solar analysis and will be used to handle a large variety of 
spectral types. 

The measured EWs were used to derive spectroscopic 
parameters for the so lar spec t ra us i ng the 2012 versio n 
of the MOOG code dSnedenl Il973l : ISobeck et al.l I 2 OIII ). 
Abundances were generated for a grid of interpolated 
Kurucz model atmospheres base d on the ATLAS9 code 
dCastelli. Gratton fc Kurucdll99m with no convective over¬ 
shooting, with solar composition spanning 3500K < Tgfr < 
8000K in steps of 50K, 0.5 <log g < 5.0 in steps of 0.1 and 
0.5 < micro-turbulence < 2.5 in steps of 0.2. The global 
minimum over the resulting distribution of Fe I abundance 
vs. excitation potential and ionisation balance, correspond¬ 
ing to the HERMES solar parameters were found to be Tefr 
= 5750K, log g — 4.5 and micro-turbulence = 0.75, where 
the slope in abundance vs. excitation potential was = -0.003 
dex/eV, slope in reduced width = 0.000 and log e (Fe I) = 
log e (Fe H) = 7.57 ± 0.05 dex. Note the solar abundance 
of Fe varies in the literature, depending on factors including 
the choice of atomic line data and model atmospheres, the 
consideration of departures from local thermodynamic equi¬ 
librium (LTE) and/or 3D hydrodynamical calculations, and 
the resulting solar parameters. The most commonly adopted 
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reference solar Fe abundance based on realistic 3D photo- 
spheric mod els and non-LTE an alysis is log e (Fe I) = 7.50 
± 0.04 dex (lAsplund et al.l[^009l l. 


5.4 Calibration and Synergistic observations 


We need to observe a number of calibration objects to 
ensure that we can transform the system of the GALAH 
data products to the systems of other observers. Experi¬ 
ence from previous surveys has shown how important it is 
to cover the parameter space of the stars as completely as 
possible. For cross calibration with othe r major surveys, 
such as APOGEE (IZas owski et al.l l20 13fl a nd Gaia-ESO 
dRandich. Gilmore fc Gaia-ESO Consortium! l2013l l. we will 
observe a number of fields observed by these surveys, con¬ 
strained by our limiting magnitude. 

For radial velocity calibration, we plan to observe a wide 
selection of stars from the Gaia radial velocity standards list. 
Additionally, the Red and IR channel data contain many 
intrinsically narrow telluric lines (O 2 and H 2 O) that act 
as zero velocity absorption lines. Such zero-point calibra¬ 
tion has been succ essfully applied in high accuracy radial 
velocity work (e.g. iMunari et al.l 1^0 il l. For stellar param¬ 
eter calibration, we use a grid of well measured individual 
stars including the Gaia benchmark stars, covering a range 
of temperature, gravity and metallicity, as well as a set of 
well observed open and globular clusters for further param¬ 
eter validation. _ 

The CoRoT space mission (lAuvergne et al. 20091') an d 
the extended Kepler mission, K2 I Howell et al.l 2014h . 
provide key opportunities for galactic archaeology with 
GALAH. The asteroseismology coming from these missions 
are very well-matched with the spectroscopy from GALAH. 
From the asteroseismology of solar-like oscillations in red gi¬ 
ants, stellar radii a nd masses will be measurable to a few per¬ 
cent accuracy le.g. lStello et al.|l2009l : lGai et al.ll201ll L These 
accurate stellar masses lead to stellar ages with precisions of 
better than 15-20%. However, this is only achievable when 
accurate stellar temperatures and metallicities are available 
from spectroscopic surveys such as GALAH. Simultaneously, 
the seismically determined log g values, w hich typically have 
uncertainties of only 0.01-0.03 dex (e.g. Gasagrande et aH 
I 2 OI 4 I : IChaplin et alj l2014l : IPinsonneault et al.l 2014l l. will 
serve as calibration for the spectral analysis of GALAH data. 

Stellar ages are notoriously difficult to det ermine, es¬ 
pecia lly for giants that probe greater distances llSoderblomI 
l2010h . Having such unprecedented age information from seis¬ 
mology in different galactic locations gives uni que insight to 
the f ormation history of the Milky Way (e.g. iMiglio et al.l 
l2013l 'l. With accurate ages, metallicities and velocities, 
these stars provide excellent opportunity to study the age- 
metallicity-velocity relation of the thin disc. Indeed, almost 
40% of the K2 stars to be observed so far have been proposed 
within the GALAH collaboration, and GALAH is on track 
to assemble spectra of order 40,000 K2 targets distributed 
across all components of the Galaxy during K2’s initial 2- 
year mission ending mid 2016. GALAH will observe two well- 
studied GoRoT fields (currently over 2000 stars observed), 
that are also observed by the APOGEE survey, which has al¬ 
ready demonstrated the potential for s eismic-spectroscopic 
collab oration in the original Kepler field llPinsonneault et al.l 
I 2 OI 4 I I. The APOGEE-GALAH overlap will play an impor¬ 


tant role in cross-calibration of the optically based GALAH 
and infrared-derived APOGEE abundance scales. 

We highlight the complementarity of Gaia 
dPerryman et aLfbOOll ') and GALAH. From about 2017, 
Gaia will provide precision astrometry for the GALAH 
stars. At the bright magnitudes of the GALAH sample, 
Gaia will be at its best, with parallax and proper motion 
uncertainties about 25 fias and 15 fias yr“^ respectively 
at the end of the Gaia survey. These correspond to ~2% 
distance errors at a distance of 1 kpc and ~1 kms“^ 
errors in transverse velocity at 15 kpc. Combined with 
GALAH radial velocities (accurate to within 0.5 kms~^), 
the outcome will be very accurate absolute magnitudes 
and 3D velocities for the whole GALAH sample. This 
provides 6 D phase-space coordinates, and enables the study 
of Galactic orbits and potential. Further, GALAH and 
Gaia combined provides the ability to construct accurate 
colour-magnitude diagrams for all relic systems that are 
recovered by chemical tagging. This will be yery valuable 
for estimating ages and as an independent check that the 
stars of recovered chemically homogeneous groups do have 
common ages. 


6 SCOPE OF THE GALAH SURVEY 


6.1 Survey simulations 

The primary motivation for the GALAH survey is the 
chemical tagging experiment described in Section [3] Our 
goal Is to identify debris of disrupted clusters and dwarf 
galaxies. About 60% of the GALAH stars will be dwarfs 
of the thin and thick disc (Table [TJ. The GALAH hori¬ 
zon for these stars is about 1 kpc. We assume here 
that all of the disrupted objects whose orbits pass 
through a ±1 kpc-wide annulus around the Galaxy at 
the solar circle are represented within the observable 
horizon. Simulations ll Bland-Hawthorn fc Free manI l2004l : 


iBland-Hawthorn. Krumholz fc Freeman 2010h show that a 
random sample of a million stars with V < 14 will allow 
detection of about 20 thick disc dwarfs from each of about 
3000 star formation sites, and about 10 thin disc dwarfs from 
each of about 30,000 star formation sites. Is it possible to de¬ 
tect the debris of about 30,000 different star formation sites, 
using chemical tagging techniques? Are there enough inde¬ 
pendent cells in G-space to make this possible? The GALAH 
survey data will test these expectations. 

The simulated numbers above depend on the details of 
the initial cluster mass function (ICMF) of the disrupted 
objects and its mass range. Lets assume the ICMF to be a 
power law specified by minimum mass Xmin, maximum mass 
Xmax and power law index 7 . Our expectation is that the 
thin disc has a steep ICMF (large 7 ) and low Xmax typical 
of a quiescent star formation history and that the thick disc 
will have a shallower ICMF (small 7 ) and large Xmax rep¬ 
resentative of the turbulent high-pressure discs seen at high 
redshift. The size of the GALAH survey is selected to probe 
the slope 7 and Xmax of the ICMF. 

We simulate the GA LAH survey based on the Galaxia 
tool (ISharma et al.llioill l and then applied a simple ana¬ 
lytical prescription for generating stellar clusters. Figure [S] 
shows the number of unique clusters identified in this sim¬ 
ulated survey as a function of initial cluster mass. Setting 


© 2015 RAS, MNRAS OOO.fTHTel 
























































10 De Silva, Freeman, Bland-Hawthorn et al. 


the minimum requirement of ten member stars for a reli¬ 
able identification of a cluster, we calculate the threshold 
cluster mass (i.e. the lowest-mass cluster we expect to re¬ 
cover ten stars from) as mr/i =10 x (Mpop x fmix)/(fpop x 
N), where Mpop is the total mass of the galactic population 
(thin or thick disc), N is the number of stars in the survey 
and fpop is the fraction of stars in the survey that belong 
to the population. The fmix is the fraction of star forming 
mass of the population that lies within the survey volume. 
If there is no mixing /mix = Mp^p/Mpop, with M'op be¬ 
ing the physical mass of the population enclosed within the 
survey volume. If mixing is maximal, i.e., stars born any¬ 
where within the Galaxy can lie in the survey volume, then 
/mix = 1. For the thick disc, Mpop =3.9 x 10® Mq and 
fpop=0.236, which means that mT;i=4.2 x 10^ for N=10® 
and we assume fmix=0.25. Clusters with initial masses be¬ 
low 4.2 X 10"^ are outside the detection limits of the survey 
as noted by the green shaded region. Less efficient radial 
mixing brings stars from a smaller number of clusters into 
our survey volume, moving all of these thresholds toward 
lower mass and making cluster identification easier. 

The red dots in Figure [6] show the cluster masses from 
which we would expect to recover 20 stars (points on the 
left) and 40 stars (points on the right) in a million-star sur¬ 
vey, and the number of such clusters we would expect to find 
if Xmax is 2 X10® (points on the blue curve) or 1 x 10® (points 
on the green curve). The red error bars show 2a Poisson un¬ 
certainty on the number and on the size of the recovered 
groups. From this simulation we see that a smaller survey 
size would mean fewer stars per formation site, from a sim¬ 
ilar number of formation sites and severely limits the range 
of cluster masses over which we can explore the ICMF. 

The above calculations assume that clustering exists in 
chemical abundance space and that the clusters are well sep¬ 
arated and observational errors are small enough (< 0.1 dex) 
such that they can be detected by clustering algorithms. In 
reality, the detectability of clusters in C-space depends upon 
the dimensionality of the C-space, the inter-cluster separa¬ 
tion and the observational uncertainties on abundance mea¬ 
surements. These questions can only be answered with a 
large enough data set that has been homogeneously anal¬ 
ysed, such as the GALAH survey data. 


6.2 The dimensionality of the GALAH chemical 
space 

Within the chemical inventory that will be made available 
from the GALAH survey data, the effectiveness of chemical 
taggi ng is not driven simp ly by the number of elements stud¬ 
ied, jRamfrez et al.l|2014h : its power derives from studying 
elements w ith independent o rigins, which vary relative to 
each other. iTing et al.l ll2012l ') made a principal component 
analysis (PCA) of element abundances from catalogues of 
metal-poor stars, metal-rich stars, open clusters and also of 
stars in the Fornax dSph galaxy. The PCA included detailed 
simulation of the effects of observational errors on the appar¬ 
ent dimensionality of the C-space. The outcome is that the 
HERMES C-space is predicted to have dimensionality of 7 
to 9 for all these samples, likely depending on the dominant 
nucleosynthetic processes in a given population. 

The principal components are vectors in the C-space 
of the element abundances [X/Fe], and these vectors could 


X 

V 



Figure 6. The number of clusters recovered from a simulated 
GALAH survey as a function of initial cluster mass, fpop is the 
fraction of stars in a survey that belong to a given population 
and N is the number of survey stars. The green shaded region 
represent the GALAH detection limits. The dark blue and light 
blue shaded regions are the ranges of initial cluster mass inacces¬ 
sible to surveys with 500,000 stars and 250,000 stars, respectively. 
The red dots show the cluster masses from which we expect to 
recover 20 stars (points on the left) and 40 stars (points on the 
right) in a million-star survey, and the number of such clusters 
we would expect to find if Xmax is 2 x 10® (points on the blue 
curve) or 1x10® (points on the green curve). The red error bars 
show 2a Poisson uncertainty on the number and on the size of 
the recovered groups. 


be identifiable with nucleosynthetic processes. The principal 
components are eigenvectors of the correlation matrix, and 
are all orthogonal in the C-space; therefore the higher com¬ 
ponents are projections on hyperplanes normal to the more 
prominent components. The number of significant principal 
components is similar for metal-rich and metal-poor stars, 
but the actual components are different, potentially reflect¬ 
ing different dominant nucleosynthetic processes for each 
sample. The interpretation of the first principal component 
(the component with the largest eigenvecto r) is usually clear , 
but it is not so obvious for the others (see lTing et al.l[2012l . 
for a detailed discussion). For example, for the sample of 
low-metallicity stars with —3.5 < [Fe/H] < —2.0, the first 
principal component includes all of the neutron capture el¬ 
ements and the a-elements. It is probably related to core¬ 
collapse SNe producing a-elements plus the r-process contri¬ 
bution to neutron capture elements. The second component 
shows an anti-correlation of a-elements with Fe-peak and 
neutron capture elements, and may be related to ’’normal” 
core -collapse SNe whi ch do not contribute to the r-process. 

iTing et"^ ||2012| ~) also compared the G-space for open 
clusters, which have Galactocentric radii from 6 to 20 kpc, 
with the C-space for metal-rich stars in the solar neighbour¬ 
hood. The G-space for the clusters has one more dimension 
than the stars near the Sun. We may find that the giants 
in the GALAH sample, which extend out to about 5 kpc 
from the Sun, will also have a larger G-space dimension like 
the clusters. Note that the examples discussed above, the 
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PC A components are based on samples of a few hundred 
stars which had not always been homogeneously analysed. 
The homogeneously observed and analysed GALAH sample 
will help to delineate the nature of the principal compo¬ 
nents more clearly, and provide insight into the nucleosyn¬ 
thesis processes that lie behind the chemical evolution of the 
Galaxy. Using PGA techniques on such a large sample of 
stars, it may be possible to isolate the contributions of the 
various nucleosynthetic processes in each Galactic compo¬ 
nent, as a function of position and velocity, or as a function 
of the orbital integrals. 

However the mathematical principal components may 
not necessarily correspond directly to astrophysical pro¬ 
cesses. They may be just residuals of correlations associated 
with the orthogonality of the principal components. For a 
clearer interpretation of the structure of G-space in terms of 
physical galactochemical evolution models, we need differ¬ 
ent analysis techniques t hat are based on physical processes 
fe.g. lWest fc Hegenl2013l '). The group and cluster finding al¬ 
gorithms discussed in the next section are a first step in this 
direction. 


6.3 Group Finding 

The end goal of the GALAH survey is to use the abundance 
data, complemented by phase space information from Gaia 
and GALAH, and age data from Gaia, to implement the 
recovery of relic systems via chemical tagging. The algorithm 
development of identifying relic groups from the GALAH 
chemical inventory is still in its infancy, but with some recent 
posi tive demonstrations. 

iMitschang et ahl ll2013h compiled a sample of open clus¬ 
ter data from the literature, ending with a sample of 2775 
individual abundance values from 291 stars from 22 sources, 
all at R >24,000 to mimic the HERMES resolving power. 
Using this data IMitschang et al.l ll2013ll presented a metric 
to quantify the chemical difference of a pair of stars and de¬ 
rived an empirical probability function, which describes the 
probability that two stars observed with a number of similar 
abundances are compatible with being formed in the same 
star formation event. However the authors note the derived 
function is based on the inhomogeneous data available at the 
time and cluster reconstruction in a chemical tagging exper¬ 
iment of large scale should be calibrated against confirmed 
disp ersing clusters. 

IMitschang et ahl (l2014l l applied the above mentioned 
methodol ogy in a first blind chem i cal ta gging experiment 
using the iBensbv. Feltzing fc OevI (l2014l i sample of about 
700 field star abundances. They reported the identification 
of many chemical groups and highlighted that this metric 
based method is more likely to be grouping stars with a 
similar evolutionary history rather than representing debris 
of relic groups from a common parent proto-cluster. The ex¬ 
ercise also confirmed that ages for chemically tagged groups 
can be derived to high accuracy via isochrone fitting to the 
location of the tagged stars in a colour-magnitude diagram 
(GMD). This approach is a considerable improvement com¬ 
pared to deriving ages for single stars from their GMD loca¬ 
tion. 

Other group finding algorithms under development 
for exploiting the GALAH chemical inventory include 
the EnLink density-based hierarchical group finding code 


dSharma fc JohnstonI l2009h , Gaussian Mixture Models to 
separate groups based on their location in the chemical space 
and Bayesian Networks as a scheme to address any missing 
abundance data in cases of abundance analysis issues (Ting 
et al. in prep). 


6.4 Chemical tagging in the inner Galactic disc 

Although young open clusters are present in the inner 
Galaxy, the old (> 1 Gyr) surviving open clusters lie mostly 
in th e outer Galaxy, beyond a radius of about 8 kpc llFriell 
Il995h . The absence of old open clusters in the inner Galaxy 
is usually attributed to the stronger disruptive influence of 
the Galactic tidal field and interactions with giant molecular 
clouds in the inner Galaxy. This suggests that we may ex¬ 
pect to find the relics of many disrupted open (and globular) 
clusters from the GALAH giants in the inner disc. The in¬ 
ner Galaxy may also contain the debris of disrupted globular 
clusters. The 0-Na anti-correlation and other light element 
abundance patterns are unique to and ubiquitous in Galac¬ 
tic globular clusters, and will help to identify the debris that 
comes from globular clusters. We expect about 200,000 sur¬ 
vey giants to lie in the inner regions of the Galaxy. 

Disrupted clusters will provide a strong test of the im¬ 
portance of radial mixing for the evolution of the disc. Open 
clusters are assumed to be on near-circular orbits when they 
are young. In the absence of radial mixing effects, their dis¬ 
persed debris would still be on near-circular orbits and be 
confined to a fairly narrow annulus around the Galaxy. On 
the other hand, the influence of radial mixing would be to 
spread the phase-mixed azimuthally dispersed debris of in¬ 
divid ual clusters over several kpc in radius llRoskar et ahl 
I2OO8II . In this way, the radial extent of the chemically tagged 
debris of disrupted clusters of various ages will give a direct 
measure of how important radial mixing has actually been 
to the evolution of the Galaxy over cosmic time. 

In this environment of disrupted clusters, we may find 
chemical streams delineated by the debris of globular clus¬ 
ters that were in relatively eccentric orbits, or by the de¬ 
bris of open clusters that have been strongly affected by ra¬ 
dial migration. It may be possible to expose such chemical 
streams by using smooth chemical ev olution models (e.g. 
iGhiappini. Matteucci fc Romanol |2001^ to subtract off the 
background abundance gradient in different elements w ithin 
the framework of the Galaxia tool llSharma et al.l[201lll . 


6.5 Origin of the Thick disc 

The origin of the thick disc is a matter of ongoing 
debate (e.g. |Abadj__et_^ |200j; |yjIlaIobos_J^JHelmi| |200^ 


Brook et al.ll2005l : iLoebman et al. 201ll : Bow. Rix fc HoeeJ 

2 OI 2 II . with suggestions that it was accreted, heated from 
the thin disc by interactions with satellite galaxies, formed 
in situ, built through radial migration and is a monotonic 
thickening of the disc. From extragalactic studies, we know 
that thick discs are common in spiral galaxies, with varying 
degrees of kinematic connection to the ir corresponding thin 
discs (e.g. lYoachim fc Dalcanton|[2008ll and a potential cor- 
respondence to mas sive star clusters in redshift ^2 galaxies 
llGenzel et al.l[2C)ll^ . 

In order to understand the formation history of the 
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thick disc component, we need to accurately measure its 
defining properties. The thick disc is not observationally 
well constrained as the spatial and kinematic overlap with 
the Galactic thin disc makes it difficult to distinguish 
the thick disc stars in small samples of data within the 
solar neighbourhood. Thick disc properties including its 
scale length, dynamical discriminators such as velocity dis¬ 
persion and rotational lag as well as detailed chemical 
properties e.g. whether or not the thick disc exhibit any 
radial or vertical metallicity gradient, continue to be a 
major topic of dis cussion. However high-resolution abun¬ 
dance studies (e.g. iFuhrmannI l2008l : iHavwood et al.l l2013l : 
iBensbv. Feltzing fc Oevll2014 l show that thick disc stars are 
enhanced in a-elements relative to thin disc stars of similar 


[Fe/H]. 

With over 20% of the sample expected to belong to the 
thick disc, GALAH will provide a sample many orders of 
magnitude larger, extending radially and vertically over a 
few kilo parsecs to provide a direct estimate of the thick 
disc metallicity gradients as well as detailed chemical abun¬ 
dances, providing insights into which of the possible origin 
scenarios or perhaps which combination of scenarios, is the 
best explanation for the present-day thick disc. It will be 
possible to use the combination of kinematics and chemical 
tagging of the thick disc to evaluate which of the possible 
origin scenarios or perhaps which combination of scenarios, 
is the best explanation for the present-day thick disc. 


6.6 Ancillary Science 

The GALAH sample of a million stars will have uniformly 
reduced and analysed high-resolution spectra, accurate ra¬ 
dial velocities and accurate parallaxes and proper motions 
from Gaia. In addition to the primary motivation for Galac¬ 
tic archaeology, the GALAH data will be an invaluable long 
term resource for a huge range of stellar and interstellar sci¬ 
ence. We briefly discuss some of this GALAH science; the 
discussion is by no means exhaustive. 


6.6.2 Accurate stellar masses for binary stars 

Accurate stellar masses are scarce, where only 95 detached 
binary syste ms have both the mass and radius m easured to 
3% or better ^Torres. Andersen fc Gimenejl2oT^ . Such sys¬ 
tems are important, as their components can be expected to 
have evolved as if they were single stars. The Gaia mission 
will derive accurate astrometric orbit s of the primary c ompo- 
nent with an orbital period <1 year llSoderhielmll200 . This 
fixes the primary orbital period, inclination and size, but not 
that of the secondary orbit. The latter will not be measured 
by Gaia, as the instrument actually measures the motion 
of the photocenter, which in unequal binaries roughly coin¬ 
cides with the primary orbit. GALAH is in a unique position 
here, as it is the only survey that will obtain sufficiently high 
signal-to-noise and wide wavelength coverage to detect the 
spectrum of both binary components and measure their ve¬ 
locity separation. A single GALAH observation of such a 
system with an astrometrically determined inclination and 
orbital phase will yield the major axis of the relative orbit 
of the two components, enabling us to determine masses of 
both components. Such a combination of Gaia astrometry 
and GALAH spectroscopy will enable very accurate mass de¬ 
terminations for orders of magnitude more binary systems 
than currently known. 


6.6.3 Active stars 

Recent survey observations of open clusters (such as RAVE 
and Gaia-ESO) find that large fractions of open cluster stars 
(up to 15%) are active, mostly in various pre main-sequence 
phases of stellar evolution. Active stars are also observed in 
the field, where chromospherically active stars are second 
only to spectroscopic binaries among the peculi ar stars from 
RAVE (iMatiievic et al.ll2012l : IZerial et akfeoi^ l . GALAH is 
uniquely capable of identifying activity using the spectral 
profiles of both Hq and H/3. Such data will help verily activ¬ 
ity detection in marginal cases and tests the physics of the 
emission region via Balmer decrement. GALAH data will 
also be employed to investigate activity effects on spectral 
lines and abundance determinations. 


6.6.1 Dynamic interstellar medium 

Interstellar absorption lines give unique insight to the prop¬ 
erties of the interstellar medium. In addition to a few atomic 
lines (e.g. Gall H & K, Nal D1 & D2, KI), the optical range 
also contains several hundred weak diffuse interstellar bands 
(DIBs) th o ught to be produced by organic macromolecules. 
IKos et ^ (l2014h found that the distribution of DIBs north 
and south of the Galactic plane is not equal based on RAVE 
spectra: the unknown DIB carrier population is not well 
mixed and must have been consequently perturbed fairly re¬ 
cently. A three-dimensional analysis of the spatial strength 
and radial velocities of DIBs leads to a 4-dimensional kine¬ 
matic picture of the interstellar medium, pinpointing ISM 
bubbles arising from supernovae and other stellar explosions 
a few million years in the past. While this study is based on 
only ~10000 independent sightlines from RAVE, GALAH 
with its higher signal-to-noise and higher spectral resolu¬ 
tion and larger sample size will be able to recover a better 
spatial sampling. Figure 0 shows preliminary data on DIBs 
and KI line as observed by GALAH. 


6.6..i Young stellar associations and open cluster 
disruption 

The presence of lithium is a strong indicator of youth for 
pre-main sequence st ars of late-G to early-M spectral types 
^Mentuch et al.l[2008h . as lithium is fragile and is easily de¬ 
stroyed. Chromospheric and other types of pre-MS activity 
are also confined to young stellar populations. By examin¬ 
ing the Li abundances and also emission cores in Ha and 
H/3 profiles, in combination with existing proper motions 
and the ~0.5 kms“^ velocities produced by the GALAH 
pipeline, we anticipate the discovery of hundreds of new low- 
mass members of young associations. This information may 
also enable identification of stellar ejecta from still bound or 
recently decomposed open clusters. 

6.6.5 Li-rich giants 

All phases of canonical stellar evolution should destroy 
lithium, but a small fraction of post-main-sequence stars 
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Figure 7. Preliminary data on diffuse interstellar bands and on the K I interstellar atomic line at 7699A as observed by GALAH. 
Spectrum of TYC 4011-102-1, a hot star with str ong interstellar absorptions close to the Galactic plane, is shown. Each 20A wide 
panel is centred on the DIB wavelength as listed in ijenniskens Desert! llOOj)- Plotted wavelengths are heliocentric. Right-most panel 
identifies two interstellar clouds for K I at different velocities. For a majority of GALAH objects which lie away from the Galactic plane 
such complications are rare (but can be detected). 


have Li abundances higher even than primordial Big Bang 
Nucleosynthesis levels. In a survey as large as GALAH, we 
expect to find many more Li-rich giants, leading to a more 
systematic understanding of their properties and the source 
of their lithium enrichment. 


6.6.6 Magellanic Clouds 

GALAH does not specifically target the Magellanic Clouds, 
but neither are they avoided. Stars with absolute magni¬ 
tudes brighter than Mv = —4.5 and —5.0, at the distance 
of the Large and Small Magellanic Clouds respectively, will 
fall within the GALAH apparent magnitude selection. Such 
stars have been tentatively identified in the data taken to- 
date. These mostly young objects will provide information 
related to their star formation sites and help identify a co¬ 
herent chemical tag. Such data will also allow investigations 
into whether these stars are ’’runaway” stars and explore 
the radial extent of the Large and Small Magellanic Clouds. 
There is also potential to identify luminous blue variables 
(supernova precursors) in the Large Magellanic Cloud. 

6.6.7 Globular clusters 

Whil e recent work (e.g. iMilone et al.l[201^ : ICarretta et al.l 
l2014h has firmly established that the stars within Galactic 
globular clusters can be separated into two or more distinct 
populations using particular elemental abundances or care¬ 
fully chosen photometry, the chemical tagging approach will 
allow a more integrated search for population complexity 
within globular clusters. Also stars that have escaped from 
globular clu sters can be identified i n the field by chemical 
tagging (e.g. lMartell fc Grebelll201ol ). St ars in globular clus - 
ter tidal tails or extra-tidal ’’halos” (e.g. lMarino et aT]|2014l ) 
will have chemistry, radial velocity and stellar parameters all 
in common with stars still in the cluster, allowing a study 
of the internal and external processes driving cluster mass 
loss. 

Globular clusters have long provided vital constraints 
on stellar evolution theory. When GALAH abundance in¬ 
formation is combined with (i) the globular cluster multiple 
population phenomenon and (ii) the asteroseismic informa¬ 
tion from the Kepler K2 mission (Section 15.41 K2 fields also 
contain globular clusters), it is clear that a new era of pre¬ 
cision stellar astronomy will be upon us. This will make 


great demands of stellar theory, forcing substantial improve¬ 
ments in the models. Conversely, the observations will serve 
as very strong multidimensional constraint s on theory (e.g. 
iBenornar et al.l[2014l : ICampbell et al. lliH). 

6.6.8 Metal-poor stars 

Metal-poor halo stars pr ovide information on star forma¬ 
tion at early epochs (e.g. ISuda et al.ll2013l ). and may even 
show the chemical pollution of Population III supernova, 
constraining the proper t ies of the first generation of stars 
llHeger fc Wooslell2010l . l2002l) . e.g., whether we may find 
single stars or binary stars, or how fast the stars rotate, 
or whether t here was even a significant contri bution from 
very massive (lAoki et al.ll2014l) or supermassive (IChen et al.l 
l2014h primordial stars. We estimate that ~ 1% of GALAH 
stars will be halo stars (Table [T]). Of t hese ^ 20% ma y be 
metal poor with [Fe/H]< —2.0 (see e.g. lAn et al. Ilmi). 

Whereas the lowest-metallicity halo stars (e.g. 
ICaffau et al.ll20rih are likely uniquely explained by coming 
from a very early generation, one or a few supernovae highly 
diluted by primordial gas, most of the metal likely end up 
in stars that may be less extreme in terms of absence of 
metals. Several halo stars of low [Fe/H] have a significant 
enhancement of C NO relative to Fe, though their very 
low Fe abundance ( Keller et al.l l2014l : iFrebel et al.l 120051 : 
IChristlieb et al.l 1200^ ) makes it hard to explain these stars 
as coming from much later epochs due to the increased Fe 
’pollution’ of the gas. The metal from Pop HI stars may 
have ended up in stars of [Fe/H] ~ —2 or even above, and 
may be e ven in the bulge or thick disk ra ther than the 
halo (e.g. lAoki et al.l 12014 : iTumlinsonI |201(]| ). Those stars 
are rare and much harder to find than the spectacularly 
iron-deficient stars. The large sample of stars from GALAH 
will increase the chances of identifying those rare stars 
and targeted follow-up observations can verify them as 
descendent of Population HI stars. 

6.6.9 Characterising exoplanet hosts 

HATSouth is a wide-field photometric s urvey of the south - 
ern hemisphere for transiting exoplanets (iBakos et al.ll201^ ). 
It monitors stars from multiple southern sites in 128 square 
degree fields with 4 minute cadence in the sloan-r band. 
HATSouth photometry is optimal for stars in the same mag¬ 
nitude ranges as the GALAH survey (10<V<14), and there- 
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fore there will be a high degree of overlap between GALAH 
target stars and stars with multi-month light curves gen¬ 
erated from the HATSouth survey. The measured depth of 
an exoplanet transit provides the planet’s radius as a func¬ 
tion of the host star’s radius. Stellar parameters of the host 
star from the GALAH pipeline will aid in determining accu¬ 
rate radii of transiting planets and whether they lie within 
the habitable zone. The GALAH abundances will provide a 
means by which to test if the stars identified by HATSouth 
as hosting a close-in transiting planet differ in abundance 
pattern from the more general Galactic stellar population. 
By cross-matching our GALAH target stars with the HAT¬ 
South light curves we will be able to determine time-variable 
photometric information for target stars, which will reveal 
pulsators, eclipsing binaries, and highly active stars in the 
set of GALAH targets. 


6.6.10 Extinction maps 

Accurate spectroscopic parameters from the GALAH 
pipel ine, in combination with available photome tric informa¬ 
tion (ISkrutskie et al.ll2006l : iHenden et al.ll2012ll . will allow a 
detailed investigation of the shape of the interstellar extinc¬ 
tion curve, which is well-documented to be n on-standard 
and n on-uniform for many Galactic sightlines (iNataf et al.l 
l2013h . GALAH data will aid in improving three-dimensional 
maps of the extinction in the nea rby Galactic disc (e.g. 
ISale et ^I20l4 iMunari et al.ll2ni4l L providing a major im¬ 
provement over standard integrated reddening maps. 


7 CONCLUSION 

The primary science driver for the HERMES instrument was 
to conduct a Galactic archaeology survey of a million stars 
to carry out the chemical tagging experiment. HERMES 
brings together a unique combination of high-resolution, 
multi-wavelength and multi-object capability over a large 
2 degree held of view and its successful commissioning on 
the AAT has enabled the start of the GALAH survey. 

With a data set signihcantly larger than any other on¬ 
going Galactic archaeology survey and a strong focus on the 
disc components of the Galaxy, the GALAH dataset will be 
well-sampled across the high-dimensional abundance space. 
GALAH will be able to thoroughly map the star-formation 
and chemical enrichment history in a broad annulus around 
the disc extending in and out from the solar radius by kpc 
in Galactocentric radius for dwarfs and ~5 kpc for clump 
giants. The fundamental goal of GALAH is to recover the 
original building blocks of disc assembly in order to generate 
a detailed physical picture of how the Milky Way and other 
spiral galaxies formed and evolved. 

In addition to the overarching goal of chemical tagging, 
the GALAH survey science drivers span a broad range across 
stellar and interstellar astrophysics, which are complemen¬ 
tary to Galactic archaeology research. The GALAH collab¬ 
oration look forward to providing a data set of enormous 
legacy value, with superior spectral resolution, abundance 
precision and sample size. The coming years will be an ex¬ 
citing time for stellar studies of the Galaxy. 
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